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claimed to be the processes for converting one language form to 
another. But there has been little effort to locate these processes 
in the human learner or to identify the nature of the internal codes. 
Computer simulation of reading is useful because the similarities in 
the human reception and perception of orthography and computer input 
allows such study. Computer simulation enables a more detailed study 
of the acquisition of reading skills than is possible in laboratory 
or classroom studies. In previous research a computer program was 
written to convert the word orthographies appearing in beginning 
readers to the segmental phonemes which define their oral 
representation. The computer program "simuread” has now been extended 
to a third grade level of reading proficiency. The segmental phonemes 
are simulated by numerals, which are designated as phoneme 
equivalents. Program output illustrating the word processing is 
included here. (JK) 
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In previous research a computer program was written to convert the word 
orthographies appearing in beginning readers to the segmental phonemes which 
define thoi- oral representation (Lcton, 1969). The computer program, Simuread. 
has nov been extended to a third grade level of reading proficiency. The 
segmental phonemes are simulated by numerals, which are designated as phoneme 
equivalents. Program output, illustrating the word processing, is presented 
In the handout. 

Automated reading, to serve the educational needs of blind persons and to 
remediate the disabilities of reading handicapped, in within the range of pos- 
sibility. There or* a number of basic and technical problems, however, which 
first need to be resolved. An example of a technical problem is the need to 
improve the comprehensibility of artificial speech. An example of a basic 
problem in reading is <:o identify the graphic and oral language determinants of 
segments! and suprasvgmental phonemes in the reading product. 

In the post decade a teaching methodology, ic «; reified aa the linguistic 
approach, has developed in the field of reading. This evolved from Bloomfield's 
(1942) recommendation that a child should be taught the alphabetic principle 
by teaching him the printed equivalents of hia oral vocabulary. Gibson, et al. 
(1962) identified the grapheme- phoneme association aa a learned unit in success- 
ful reading; and Chall (1965), Goodman (1968) and others, have referred to the 
process as coding . 

ML f« eve r claims ray ba made for the valve of learning the alphabetic 
print ip ■ It must alt*o be recognleed that th^ alphabet is the greatest error* 
producing device in the English language. Having 26 symbols, none of which 
cons latently represent a single phoneme, each representing two to eight 
significant variations of sound, Increases the risks of reading errors. The 
fact that each letter character has some phonemic value or values, and that 
most may conditionally have sero phoneme value also produces reading errors. 

It would be Inaccurate to imply that the alphabet behaves in a completely 
unprincipled manner in its relationships with oral language. Successful 
reading refutes such an implication. It is generally recognised that the 
conversion of the printed representation of the language to a corresponding 
oral representation is the essential definition of reading. Unfortunately, 
the views that the alphabetic principle , coding , or that grapheme* phoneme 
associations define this relationship are oversimplified. 

The printed orthography Includes punctuation marks, and logograms as well 
as the letters of the alphabet. Inadequacies of the alphabetic principle are 



illustrated in the following examples of orthography: <we're> is alphabetically 

identical to <were>, <does> is more commonly expressed aa /das/ than aa /dowz/, 
and <read> may convert to /r e d/ or to /r iy d/. The alphabetic principle has 
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little relevance to reading the following examples of orthography <@ 96c per 
doz. « 8c per oz.>, and <% , c/o, #, *fc>. The phoneme chains that are required 
for the numerals <1> and <2> are different from the phoneme chain <12>. 

A grapheme, as It Is traditionally defined la a minimum unit of graphics, 
l.e., each letter of the alphabet and each punctuation mark. Not all graphemes 
however convert to phonemes In oral reading. Some graphemes Indirectly Influence 
other grapheme- phoneme associations, e.g., the <e> In <rate>. The three 
graphemes In <the> convert to two segmental phonemes, and the three graphemes 
In <tax> require four segmental phonemes /t ® k s/ for Its oral representation. 
The /a/ phoneme In /ey b a 1/ Is not designated by any of the graphemes In 
<able>. 

From the field of psycholinguistics the terms coding, encoding, decoding, 
and recoding have been used to refer to the process of changing one language 
representation to another, e.g., aural to speech (Osgood, 1963) and visual to 
oral reading (Goodman, 1968). Chall (1965) recommends that coding be taught as 
a skill for beginning reading. There are erroneous assumptions as well as 
faulty definitions In these theories and teaching recommendations. There are 
at least three separate and Interrelated codes In orthography: alphabet, 

punctuation, and logograms. There are at least two codes In phonemlcs: 
segmental phonemes and suprasegmental phonemes, l.e., stress, pitch and junc- 
ture. Some of the suprasegmental phonemes are determinable from the orthography 
whereas others represent oral patterning. When the term coding Is used to refer 
to the relationship between the printed and oral representations of language It 
Is not a s Ingle, nor a simple process, even for beginning reading. 

The use of the terms encoding and decoding to explain reading and language 
processes may Indicate an effort to utilize communication and Information 
theories. Unfortunately, however, the terms are misused and sometimes defined 
Incorrectly. Both Goodman (p. 17-19) and Osgood (p. 99-100) define decoding as 
a process of extracting meaning from the oral or the graphic representation of 
language, whereas, It Is ordinarily defined as a procedure for converting a 
coded message Into Its prior language representation. Technically, It would be 
more correct to regard both the written and oral representations of language 
as hierarchical systems of codes, rather than as sources of Information with 
meaning. The terms encoding and recoding used by these theorists refer to the 
coding of codes, l.e., of previous symbols and patterns of symbols. Devices 
such as telephone receivers and radio receivers decode patterns of electrical 
and radio waves. A listener may then comprehend, or extract meaning from the 
phonic codes of the oral language, but this Is subsequent to the decoding. 

Although coding and decoding are claimed to be the processes for converting 
one language form to another, there Is little or no effort to locate these 
processes In the human learner, nor to Identify the nature of the Internal 
codes. This Is a shortcoming In these theories* It would be necessary to 
Identify the neuroanatomlcal structures, and the afferent- efferent and brain 
processes which produce the Internal codes. 

To refer to these briefly, there are visual and aural stimuli emanating 
from the graphic and phonic systems. These are transduced by the sense organs 
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and encoded In neuro-electrical impulses which are transmitted along the optic 
and cochlear nerve pathways, respectively. For the production of speech and 
script the efferent pathways transmit coded impulses to the muscles in the end 
organs for these expressions. Decoding, or the association and comprehension 
processes appear to be discrete and integrated cerebral processes, however. 

The perceptual, control and motoral codes are not adequately objectified in the 
existing theories. The dysfunction of these processes may be clinically recog- 
nized in cases of dyslexia, dysphasia, and dysgraphla; but there is not a 
sufficient explanation of them for understanding normal language processing. 

This critique of terminology and theories is not intended to discredit 
the teaching methods for developmental or remedial reading. Rather, a knowledge 
of theory limitations is regarded as necessary for a further understanding of 
the reading process. 

The problem is reconceptualized as 1. the representation of orthography 
in Interrelated graphic codes, 2. the segmentation or decomposition of orthogra- 
phy into graphic signals, 3. their association to phonic units, e.g., segmental 
phonemes, and 4. the modification of the phonemes by suprasegmental phonemes. 

The computer simulation of reading is a double simulation problem. The 
orthography is simulated by holes in data cards and by patterns of electrical 
current in the computer. The holes in the data cards are keyed to represent 
the letters of the alphabet, punctuation, numerals and other logographs on a 
key punch. These holes are converted to electrical currents and are represented 
in on-off states, i.e., they are binary coded. 

There are similarities in the human reception and perception of orthography 
and computer input. The human eye and the key punch can both be regarded as 
transducers. They transfer the orthography from the external environment and 
begin the process of internal coding. The energy in the light waves reflected 
from the orthography stimulate a photo-chemical process in the eye. The photo- 
chemical states are then converted into electrical states which are then 
transmitted on the optic nerve pathways to the occipital cortex. There are 
on-off states in the afferent pathways and in brain nerve cells on which the 
patterned electrical impulses are superimposed. Because of the coordinate 
nerve system, and the cerebellar structures and pathways which enable proprio- 
ception, the human reception of visual information is more complex than the 
computer reception. 

The coding for the oral representation of language emanates in the motor 
cortex. The patterned electrical currents are transmitted on the efferent 
pathways and Innervate the speech muscles. These operate the speech organs, to 
produce the oral expression of the language. The printer or output key punch 
and the speech organs are similar transducers. They transfer the energy from 
the electric current in the internal environment to the printer or the phonetic- 
acoustic code in the external environment. In this project the printer will 
simulate the oral language by printing predesignated numeral symbols for 
segmental phonemes. The human system for language expression is more complex, 
and more flexible than the computer system, but the computer holds a wide 
advantage in its rates for processing and output. 
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The ability to receive coded representations of orthography and the 
ability to produce language output are prerequisites to reading. They are 
characteristic abilities of the child- learner and the computer systems. 

An entertaining question poses itself. Is it easier to teach a computer 
how to read, or is it easier to teach a child how to read? This leads to 
consideration of their Internal abilities. They both present associative and 
memory capabilities to their teachers. The associative capability is necessary 
for relating the elements in the orthography codes to the coding apparatus for 
producing oral language. There is a marked difference in the nature of computer 
memory and human memory as these pertain to reading. In human memory there is 
a two-stage process of immediate recall and long-range memory. There are 
indications also that the visual, aural, tactual and motoral inputs into 
human's memory are processed in afferent and subcortical pathways so that 
certain features, or codes, are selectively enhanced or dampened, prior to 
their integration and inclusion in memeory. In contrast, the computer memory 
is a single state reservoir. Its potential use in artificial reading has not 
been evaluated as yet. 

It may be equally easy, or equally difficult, to teach the computer and 
the child to read. The essential process is the same* The general procedures 
for the computer simulation which have been developed in previous research will 
be briefly described. 

The minimum units of orthography, to which a segmental phoneme, a unitary 
phoneme- combination, or a non-phoneme is associated, are defined as signals. 

The phonemic associations are defined as designates. The alphabet characters, 
punctuation marks, numerals and other logograms are pre-speclfied as acceptable 
input. The computer program: 1) segments the orthography to identify its 

signal units, 2) tests for the existence of conditions in the word environment 
which determine its phoneme designate, and 3) outputs a pre- specified numeral 
equivalent of the phoneme designate. In Phase I of Simuread, i.e., learning 
and processing of primer words, the first occurrence of the signal did not 
require any identification of the environmental context for its initial associ- 
ation. As long as this sign-designate association continued in succeeding 
words the same phoneme output prevailed, and response- generalization form of 
learning was simulated. 

When the signal required another phoneme designate, the conditions in the 
word environment which determined the second designate were identified. Thus, 

8 timulus- discrimination learning was simulated in the program. The association 
to the second designate introduced response-discrimination learning. 

Simuread is an algorithm for reading. The rules for each sign are 
developed as they occur in the texts of the readers. The algorithm is deter- 
ministic; the rules denote the slgn-designates associations. The flow chart 
for Simuread is drawn on 1081 sheets of 8 1/2 x 11" paper, and the program 
contains about 4,000 Fortran statements. The program includes a procedure to 
compute the frequencies of associations and the conditional association matrix. 
This is a row matrix in which each row contains an independent graphic signal 
and each column contains a phoneme designate. Figure 1 in the handout indicates 
the procedures in Simuread. 
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A variation of the Trager and Smith phoneme system is used in this research. 
The phonic units include 33 segmental phonemes, 11 phoneme combinations, one 
juncture (suprasegmental phoneme) and a nonphoneme symbol for a silent grapheme, 
indicated on handout. 

For the purposes of this presentation the following word corpora were 
processed: 1. Dale's list of 769 easy words for the Lorge formula of reading 

difficulty, 2. Houghton Mifflin's first grade readers and 3. Bank Street first 
grade readers. A statistical comparison of the grapheme phoneme associations 
in the Houghton Mifflin and Bank Street readers is presented in table l on the 
handout. The preponderance of similarity in the two matrices is due to the 
similar use of common English words; the differences in the frequencies of 
grapheme* phoneme associations stems from the differences in words and redundan- 
cies. 



Authors and publishers of basic readers vary in their cognizance and 
control of word variables. They also show some awareness to the introduction 
of consonants, vowels and diphthongs in the vocabulary. Statistical analyses 
of the grapheme* phoneme associations which are required at various levels of 
a reading program have not previously been determined. These analyses provide 
basic information to show gradients in the acquisition of reading skills, l.e., 
the correct association of phonemes to the graphic units in words. This in* 
formation may be of value for developing readers in which the grapheme* phoneme 
associations compare to a standard, for example. Standard American oral English. 
Or, the frequency may be controlled to reduce or Increase certain phoneme 
associations for the reading instruction of bilingual or of language handicapped 
children. 

Computer simulation of reading enables a more detailed study of the 
acquisition of reading skills than is possible in laboratory or classroom 
studies. The analyses of the conditional association matrix provides an 
objective means for identifying the learning* processing demands of various 
reading programs. There has been a tendency to grade beginning readers on the 
basis of word* vocabulary rate. Measures of reading difficulty are also based 
on word length and sentence length. The identification of the graphic signals, 
phonemic designates, their redundancies, and the environemental determinants 
of the associations are key variables in defining the acquisition of reading 
skills. 
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Abstract 

Numeric representations of phonemic units : The nine vowels 

/i e s i uu o )/ are assigned numerals 1 through 9; twenty consonants 
/ptkbdgcj f 0 h v <3* s s 2 z l m n q w r y/ are assigned numerals 10 
through 33; /ks/ as one of the representations for <x> is assigned 34; eight 
vowel- semi vowel nuclei /iy ey ay oy uw ow aw/ are assigned 35 through 41; 
the combinations /yu/ and /we/, as minimum representations for <u> as in: 
<use> and for <o> as in <one>, are assigned 42 and 43, the juncture phoneme 
/•f/ is assigned 49, and the non phoneme symbol /0/ is assigned 50. 
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Output 



Pre- specified set of alphabet, 
punctuation and logogram characters 
Pre- specified set of numeric 
representations of phonic units 



data 




Figure It Chart of Simuread Procedures 



Sample A. Seme philosophic statements. Sample B. Words from Lorge.'s list of 769 easy words. 
Sample C. from Houghton Mifflin 1-1 Jack and Janet . Sample D. from Bank Street Reader 1st Grade 
Uptown Oowntoun. 
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ILLUSTRAT 1 OH OF SIMURHAD OUTPUT 
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Illustration of Rows in the Conditional Association Matrix 

Rows extracted from the graphic -phonemic matrix indicating total 
occurences of graphic units and percentages of phoneme associations* 
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Table l 



R^Totals^ M atrices for 1st Grade Readers 





H*M Jock & Janet"" 


I'p & Away Bank St, 


Uptown* Downtown 


Associations: 


22323 


29856 


25755 


Graphs 




Row Totals 




<b> 


90 


13 


8 


<c> 


249 


619 


538 


<ch> 


m 


r> o 


120 


<ck> 


. 286 


176 


63 


<d> 


1370 


1686 


1672 


<£> 


30 


86 


47 


<g> 


338 


453 


413 


<gh> 


10 


36 


65 


<h> 


578 


899 


717 


<J> 


380 


178 


116 


<k> 


481 


390 


472 


<1> 


1065 


1459 


1215 


<le> 


31 


73 . 


112 


<m> 


594 


802 


823 


<n> 


1397 


1881 


1818 


<ng> 


48 


191 


123 


<P> 


350 


579 


400 


<r> 


805 


1412 


1639 


<8> 


1272 • 


1756 


1812 


<sh> 


129 


184 


106 


<t> 


1942 


2453 


2000 


<th> 


664 


913 


1038 


<v> 


131 


-- 


206 


<\f> 


472 


826 


459 


<wh> 


183 


147 


96 


<x> 


29 


. 33 


46 


<y> 


540 


609 


612 


<z> 


m m 


33 


3 


<a> 


1632 


2367 


1984 


<al> 


338 


392 


338 


<ay> 


108 


165 


139 


<au> 




39 


17 


<aw> 


48 


60 


48 


<e> • 


2478 


3208 


2576 


<e*a> 


37 


103 


89 


<ee> 


110 


124 




<ei> 




mb 


19 


<eo> 






17 


<ew> 


19 


11 


27 


<ey> 


17 


92 


80 


<i> 


2028 


1955 


1549 


<le> 


MB MB 


109 


60 


<o> 


1230 


1801 


1429 


<oa> 


14 


29 


22 


<oe> 


19 


19 
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<oi> 
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14 


26 


<oo> 


195 


284 


344 


<ou> 


383 


437 


338 


<ow> 


mb m* 


24 


502 


<oy> 


48 


24 


60 
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154 


406 


345 


<ue> . 
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56 


mb m 


<ul> 
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No tokens for graphs: <ci> <ph.:* <q> <rh> <sci> <si> <ti> <ye> <ae> <eau> 

<eu> <ia> <owa> <ua> <uy> 



